Fix: Incorrect use of partial in `TweedieDistribution._rowwise_gradient_hessian` #889

bili2002 · 2024-12-23T14:38:09Z

This PR addresses issue #888 by fixing the incorrect use of partial in the TweedieDistribution._rowwise_gradient_hessian function.

Checklist

Added a CHANGELOG.rst entry

stanmart

Thank you for finding fixing this! I believe there is a similar mistake here:

glum/src/glum/_distribution.py

Line 706 in 9c0a221

f = partial(inv_gaussian_log_eta_mu_deviance, p=self.power)

Could you please address it too so that they are in the same PR?

Also, I think it'd be worth adding a test case that would have spotted these bugs (maybe here?), but I'm also o kay with doing it later.

stanmart · 2024-12-23T21:11:36Z

Hm, it might be a bit more involved than that. Is it possible that inv_gaussian_log_rowwise_gradient_hessian is incorrect? Shouldn't it be

@@ -276,12 +276,12 @@ def inv_gaussian_log_rowwise_gradient_hessian(
         inv_mu = 1 / mu[i]
         inv_mu2 = inv_mu ** 2
 
-        gradient_rows_out[i] = 2 * weights[i] * (inv_mu - y[i] * inv_mu2)
-        hessian_rows_out[i] = 2 * weights[i] * (inv_mu - 2 * y[i] * inv_mu2)
+        gradient_rows_out[i] = weights[i] * (y[i] * inv_mu2 - inv_mu)
+        hessian_rows_out[i] = weights[i] * (2 * y[i] * inv_mu2 - inv_mu)

instead?

stanmart

Thank you :)

stanmart · 2025-01-08T15:02:31Z

@bili2002, do you mind if I push the fix for those formulas to this branch before merging this PR?

bili2002 · 2025-01-09T09:03:14Z

@stanmart Sure, go ahead with the formulas!

stanmart · 2025-01-09T09:22:28Z

@lbittarello, could you please check if my changes make sense?

stanmart · 2025-01-09T09:24:57Z

Also, if it's correct, I believe the relevant part of quantco/objectives needs to be updated, too.

CHANGELOG.rst

src/glum/_functions.pyx

Co-authored-by: Luca Bittarello <15511539+lbittarello@users.noreply.github.com>

lbittarello · 2025-01-09T13:03:03Z

tests/glm/test_glm.py

+    assert_allclose(glm.intercept_, glmnet_intercept, rtol=1e-3)
+    assert_allclose(glm.coef_, glmnet_coef, rtol=1e-3)


Out of curiosity, what happens if you use the full Hessian? Do the tests pass? Are the coefficients closer to glmnet's?

The glmnet test passes and the coefficients are roughly as close as in the other case. Another test (tests/glm/test_glm.py::test_glm_family_argument[inverse.gaussian-fam4]) fails with a singular matrix LinalgError, though.

The failing test might not be an issue though as I believe the model was originally meant to be penalized, which but it is not since glum v3. 🤔 Anyways, I think it needs a bit more thinking and testing if we want to go with the true Hessian. I'd vote for merging this fix relatively soon, and possibly switching to the true Hessian later on if we convince ourselves that it works.

bili2002 added 2 commits December 23, 2024 16:31

fix

2d71452

Add changelog

b0ab7e4

bili2002 requested review from MarcAntoineSchmidtQC, jtilly, lbittarello and stanmart as code owners December 23, 2024 14:38

stanmart requested changes Dec 23, 2024

View reviewed changes

fix

644cc16

stanmart approved these changes Dec 24, 2024

View reviewed changes

stanmart mentioned this pull request Jan 2, 2025

Gradient and hessian incorrect for inverse gaussian #891

Closed

Merge branch 'main' into fix-inverse-gaussian-call

00408d9

stanmart added 4 commits January 9, 2025 10:17

Add tests demonstrating the issue

36a5916

Use inv_gaussian_log_* functions when appropriate

a2dde9e

Fix inverse gaussian derivatives

797f0b1

Update changelog

27b123d

This was linked to issues Jan 9, 2025

Incorrect use of partial in function TweedieDistribution._rowwise_gradient_hessian #888

Closed

Gradient and hessian incorrect for inverse gaussian #891

Closed

lbittarello reviewed Jan 9, 2025

View reviewed changes

CHANGELOG.rst Outdated Show resolved Hide resolved

src/glum/_functions.pyx Show resolved Hide resolved

stanmart and others added 3 commits January 9, 2025 11:24

Update CHANGELOG.rst

dd2a6c3

Co-authored-by: Luca Bittarello <15511539+lbittarello@users.noreply.github.com>

Clarify why we are using the FIM

6f18433

Add test against glmnet

b67feb4

lbittarello approved these changes Jan 9, 2025

View reviewed changes

stanmart merged commit 3e483b9 into main Jan 9, 2025
24 checks passed

stanmart deleted the fix-inverse-gaussian-call branch January 9, 2025 13:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Incorrect use of partial in `TweedieDistribution._rowwise_gradient_hessian` #889

Fix: Incorrect use of partial in `TweedieDistribution._rowwise_gradient_hessian` #889

bili2002 commented Dec 23, 2024

stanmart left a comment •

edited

Loading

stanmart commented Dec 23, 2024

stanmart left a comment

stanmart commented Jan 8, 2025

bili2002 commented Jan 9, 2025

stanmart commented Jan 9, 2025

stanmart commented Jan 9, 2025

lbittarello Jan 9, 2025

stanmart Jan 9, 2025

stanmart Jan 9, 2025

		assert_allclose(glm.intercept_, glmnet_intercept, rtol=1e-3)
		assert_allclose(glm.coef_, glmnet_coef, rtol=1e-3)

Fix: Incorrect use of partial in TweedieDistribution._rowwise_gradient_hessian #889

Fix: Incorrect use of partial in TweedieDistribution._rowwise_gradient_hessian #889

Conversation

bili2002 commented Dec 23, 2024

stanmart left a comment • edited Loading

Choose a reason for hiding this comment

stanmart commented Dec 23, 2024

stanmart left a comment

Choose a reason for hiding this comment

stanmart commented Jan 8, 2025

bili2002 commented Jan 9, 2025

stanmart commented Jan 9, 2025

stanmart commented Jan 9, 2025

lbittarello Jan 9, 2025

Choose a reason for hiding this comment

stanmart Jan 9, 2025

Choose a reason for hiding this comment

stanmart Jan 9, 2025

Choose a reason for hiding this comment

Fix: Incorrect use of partial in `TweedieDistribution._rowwise_gradient_hessian` #889

Fix: Incorrect use of partial in `TweedieDistribution._rowwise_gradient_hessian` #889

stanmart left a comment •

edited

Loading